首页> 外文OA文献 >RapidRAID: Pipelined Erasure Codes for Fast Data Archival in Distributed Storage Systems

【2h】

RapidRAID: Pipelined Erasure Codes for Fast Data Archival in Distributed Storage Systems

机译：RapidRaID：用于分布式快速数据存档的流水线擦除代码存储系统

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

To achieve reliability in distributed storage systems, data has usually beenreplicated across different nodes. However the increasing volume of data to bestored has motivated the introduction of erasure codes, a storage efficientalternative to replication, particularly suited for archival in data centers,where old datasets (rarely accessed) can be erasure encoded, while replicas aremaintained only for the latest data. Many recent works consider the design ofnew storage-centric erasure codes for improved repairability. In contrast, thispaper addresses the migration from replication to encoding: traditionallyerasure coding is an atomic operation in that a single node with the wholeobject encodes and uploads all the encoded pieces. Although large datasets canbe concurrently archived by distributing individual object encodings amongdifferent nodes, the network and computing capacity of individual nodesconstrain the archival process due to such atomicity. We propose a new pipelined coding strategy that distributes the network andcomputing load of single-object encodings among different nodes, which alsospeeds up multiple object archival. We further present RapidRAID codes, anexplicit family of pipelined erasure codes which provides fast archival withoutcompromising either data reliability or storage overheads. Finally, we providea real implementation of RapidRAID codes and benchmark its performance usingboth a cluster of 50 nodes and a set of Amazon EC2 instances. Experiments showthat RapidRAID codes reduce a single object's coding time by up to 90%, whilewhen multiple objects are encoded concurrently, the reduction is up to 20%.

机译：为了在分布式存储系统中实现可靠性，通常已在不同节点之间复制了数据。但是，要存储的数据量不断增加，促使人们引入擦除代码，这是复制的一种有效存储方式，特别适用于数据中心的归档，在该数据中心中，旧数据集（很少访问）可以被擦除编码，而副本仅用于最新数据。许多近期的工作都考虑了设计新的以存储为中心的擦除代码，以提高可修复性。相比之下，本文介绍了从复制到编码的迁移：传统上，擦除编码是一种原子操作，其中具有整个对象的单个节点编码并上传所有编码片段。尽管可以通过在不同节点之间分配单个对象编码来同时归档大型数据集，但是由于这种原子性，单个节点的网络和计算能力会限制归档过程。我们提出了一种新的流水线编码策略，该策略在不同节点之间分配网络并计算单对象编码的负载，这也加快了多对象归档的速度。我们进一步介绍了RapidRAID代码，这是流水线擦除代码的显式系列，可提供快速归档，而不会影响数据可靠性或存储开销。最后，我们提供了RapidRAID代码的真实实现并使用50个节点的集群和一组Amazon EC2实例对它的性能进行了基准测试。实验表明，RapidRAID代码最多可将单个对象的编码时间减少90％，而同时对多个对象进行编码时，最多可减少20％。

著录项

作者
Pamies-Juarez, Lluis; Datta, Anwitaman; Oggier, Frederique;
展开▼
作者单位

展开▼
年度 2012
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Distributed Data Storage Systems for Data Survivability in Wireless Sensor Networks using Decentralized Erasure Codes [J] . Al-Awami Louai, Hassanein Hossam S. Computer networks . 2016,第Mara14期

机译：使用分散式擦除码在无线传感器网络中实现数据生存能力的分布式数据存储系统
2. Repair Tree: Fast Repair for Single Failure in Erasure-Coded Distributed Storage Systems [J] . Huayu Zhang, Hui Li, Shuo-Yen Robert Li IEEE Transactions on Parallel and Distributed Systems . 2017,第6期

机译：维修树：快速维修，适用于采用纠删码的分布式存储系统中的单个故障
3. Efficient in-place update with grouped and pipelined data transmission in erasure-coded storage systems [J] . Xiaoqiang Pei, Yijie Wang, Xingkong Ma, Future generation computer systems . 2017,第APRa期

机译：在擦除编码存储系统中通过分组和流水线数据传输进行有效的就地更新
4. RapidRAID: Pipelined erasure codes for fast data archival in distributed storage systems [C] . Pamies-Juarez Lluis, Datta Anwitaman, Oggier Frederique IEEE INFOCOM . 2013

机译：RapidRAID：流水线擦除代码，用于分布式存储系统中的快速数据归档
5. Erasure Codes for Optimal Node Repairs in Distributed Storage Systems. [D] . Goparaju, Sreechakra. 2014

机译：分布式存储系统中用于最佳节点修复的擦除代码。
6. NOREC4DNA: using near-optimal rateless erasure codes for DNA storage [O] . Peter Michael Schwarz, Bernd Freisleben 2021

机译：NOREC4DNA：使用用于DNA储存的近乎最佳的无数擦除码
7. RapidRAID: Pipelined Erasure Codes for Fast Data Archival in Distributed Storage Systems [O] . Lluis Pamies-juarez, Anwitaman Datta, Frederique Oggier 2016

机译：RapidRaID：用于分布式存储系统中快速数据存档的流水线擦除代码

RapidRAID: Pipelined Erasure Codes for Fast Data Archival in Distributed Storage Systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅